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Abstract 


Using  the  idea  of  maximum  likelihood,  we  derive  an  estimator  for 
a  distribution  function  possessing  an  increasing  (decreasing)  failure 
rate  and  also  obtain  corresponding  estimators  for  the  density  and  the 
failure  rate.  We  show  that  these  estimators  are  consistent. 


Maximum  Likelihood  Estimation  for  Distributions 
with  Monotone  Failure  Rate 

Albert  W.  Marshall  and  Frank  Proschan 
Boeing  Scientific  Research  Laboratories 

1.  Introduction.  Given  a  set  of  observations  X^,...,Xn  from  a 
common  distribution  function  F,  it  is  natural  in  the  absence  of  additional 
information  to  estimate  F  by  the  usual  empirical  distribution  function. 
However,  one  would  not  use  this  estimator  if  there  were  at  hand  sufficient 
a  priori  information  about  the  distribution  F,  e.g. ,  that  F  is  a  member 
of  a  given  parametric  class  such  as  the  normal.  In  this  paper,  we  examine 
an  intermediate  case ,  the  case  that  F  is  known  to  have  increasing 
(decreasing)  failure  rate.  Using  the  idea  of  maximum  likelihood,  we  derive 
an  estimator  for  F  which  itself  has  increasing  (decreasing)  failure  rate, 
and  also  obtain  estimators  for  the  density  and  failure  rate.  These  estima¬ 
tors  are  shown  to  be  consistent. 

2.  Properties  of  IFR  distributions,  and  formulation  of  the  problem. 

The  failure  rate  r  of  a  distribution  F  having  density  f  is  defined 

by  r(x)  =  f(x)/[l  -  F(x)],  for  F(x)  <  1.  It  is  easy  to  verify  that  if 

x 

r  is  increasing,  then  J"  r(z)dz  =  R(x)  =  -logCl  -  F(x)]  is  convex  on  the 

0 

support  of  F,  an  interval.  (Throughout  this  paper  we  write  "increasing" 
for  "nondecreasing"  and  "decreasing"  for  "nonincreasing.")  Whether  f 
exists  or  not,  we  say  that  F  has  increasing  failure  rate  (IFR)  if  the 


2 


support  of  F  is  of  the  form  [a,p]  -°°  <  a  <  p  <  ”,  and  if  R  is  convex 
on  [a,p).  The  importance  of  the  IFR  property  and  its  applications  to 
life  testing  and  reliability  are  discussed  in  [3,4]. 


The  continuous  part  of  an  IFR  distribution  F  is  absolutely  continuous. 

To  see  this,  choose  e  >  0,  z  such  that  R(z)  <  ”,  and  points 

m 

<  p^  <  a2  <  <  *  *  ‘  <  am  <  0m  —  z  satis:fyirig  £  ( p^  -  ou)  <  e/r+(z), 

where  r  (z)  =  lim  [R(z  +  &)  -  R(z)]/6  exists  finitely  since  R  is  convex. 
610 


Then 


m  m  R( p. )  -  R(a. )  m 

E  |R(P. )  -  R(a.  )  |  =  £  - ^ - - — —  (p.  -  a  )  <  r+(z)£  (  P .  -  a,  )  < 

2  x  1  ^  Pi_ai  xi—  j  i  i  ~ 

Thus  R  is  absolutely  continuous  on  (-«>,z),  and  the  result  follows. 


For  convenience,  if  F  is  IFR  we  define  r(x)  =  <*>  for  all  x  such 

that  F(x)  =  1.  Note  that  for  any  distribution  F  and  any  x  for  which 

r  is  defined  on  (~”,x),  we  have 

x 

(2.1)  1  -  F(x)  =  exp[-  J  r( z)dz] . 

—00 

Further,  properties  of  IFR  distributions  have  been  discussed  in  [2], 


Let  0  be  the  class  of  IFR  distributions ,  and  let  X.  <  X_  <  • • •  <  X 

be  obtained  by  ordering  a  random  sample  from  an  unknown  distribution  F  in 

0,  It  is  not  possible  to  obtain  a  maximum  likelihood  estimator  for  F  «  0 

n 

directly  by  maximizing  TT  f(X. )  ,  since  for  F  «  0,  f(Xn)  can  be  arbitrarily 

_h 

large.  Consequently,  we  first  consider  the  subclass  0  of  distributions 
F  in  0  with  corresponding  failure  rates  bounded  by  M,  obtaining 

sup  TT  f(X. )  <  M11.  We  shall  see  that  there  is  a  unique  distribution  Fr  in  0 
F^1  1 


I 


at  which  the  supremum  is  attained.  The  conventional  maximum  likelihood 

estimators  F^  for  £7^  converge  in  distribution  as  M  -*  »  (i.e. ,  as 

to  an  estimator  Fn  €  £7  which  we  call  maximum  likelihood  for  g'. 

Furthermore,  the  density  and  failure  rate  r^  of  F*  converge  in  a 

’  n  n  n 

natural  way  to  the  density  f  and  failure  rate  ?n  (of  the  continuous 
part)  of  F^  as  is  shown  below  in  Section  3. 

3.  Derivation  of  the  estimators.  From  (2.1)  we  obtain  that  the  log 

M 

likelihood  L  =  L(F)  is  given,  for  F  e  £7  ,  by 

(3.D 


X. 

n  n  i 

L  =  2  log  r(X.)  -  £  J"  r(z)dz. 
1  1  1 


L  is  maximized  over  £7^  by  a  distribution  with  failure  rate  constant  between 

fM  . _  _ _ _  n  « 


observations ,  as  may 

be  seen 

as  follows :  Let 

F  e 

and  let 

F*  be  the  distribution  with  failure 

rate 

0  , 

x  <  Xx 

(3.2) 

r*(x)  = 

r(xi) , 

X.  <  x  <  X.  .  , 
i  —  l+l 

i  =  1 

r(Xn) , 

x  >  X 
—  n 

X. 

1 

Then  F* 

e  and  r(x)  >  r 

*(x)  so  that 

L r{ 

for  all 

i;  we  conclude  that 

L(F)  <  L(F*)  . 

Thus , 

X. 
_  1 


the  function 


n-1 


(3.3) 


E  log  r(X±)  -  E  (n  -  i)(Xi+1  -  Xi)r(Xi), 


1  1 

The  procedure  for  maximization  of  (3.3)  subject  to  r(X^)  <  <  r(^n)  =  ^ 

can  be  obtained  as  a  direct  application  of  [6,  Corollary  2.1  and  the  discussion 
following]  (see  also  [10,  11]).  This  procedure  yields  for  r  (corresponding 
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to  F  e £7^)  the  estimator 
(3.4) 


r  (X.)  =  min  max  { -  [r  +  •*•  +  r  .]} 

n  x  _  v-u  u  v-1 

V>1+1  U<1 


where 


(3.5) 


r.  =  C(n  -  j)(Xj+1-X.)]_1,  j  =  1,2 n-1 ,  rR  =  M  , 

with  the  convention  that  r..  =  M  when  [(n  -  j)  ( X^.+^  -  X  . ) ]  ^  >  M. 

The  maximization  procedure  which  yields  (3.4)  may  be  described  as 
follows.  First,  find  the  maximum  of  (3.3)  restricted  only  by  r(x)  <  M, 
obtaining  (3.5).  If  there  is  a  reversal,  say  r^  >  then  set 

r(X^)  =  r(Xi+1)  in  (3.3)  and  repeat  the  procedure.  After  at  most  n 

steps  of  this  kind,  a  monotone  estimator  is  obtained.  The  maximum  derived 

with  r(X^)  =  r(Xi+1)  can  be  directly  obtained  by  replacing  r^  and  ri+^ 

Succeeding  steps  amount  to  further 
such  averaging  which  is  extended  just  to  the  point  necessary  to  eliminate 
all  reversals.  It  can  be  seen  that  this  is  exactly  what  is  called  for  in 


by  their  harmonic  mean,  (r.^  +  r.1,  )  1 
J  *  l  l+l 


(3.4).  (In  this  connection,  see  also  [1,5].)  The  resulting  estimator  r 


M 


is  of  the  form 


rn(x)  = 


0, 


x  <  X1 


r  ,  ,X  .  <  x  <  X 

n.  +1  ,n.  ,  ’  n.  +1  —  n 

i  ’  i+l  i 


+1 


M, 


x  >  X 

—  n 


where  r.  <  r  < 

1 ini  “  ni+l >n2  ~ 

and  r 


<  r 


"k 


+1 ,n-l 


i+l 


,  0  =  n  <  n  < 


<  \<  n-1. 
Of 


n^+1,^  ^  is  the  harmonic  mean  of  rn  +j>  rn  +2’””  rn 

i+  x+  i+l 

course,  the  n^  are  determined  by  the  rule  which  determines  the  extent  of 


i 
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the  averaging. 

The  estimator  for  r  corresponding  to  F  e  9  is  obtained  by  letting 
M  -*  ■»  in  (3.4),  and  is  given  by 

(3.6)  r  (X.)  =  min  max  { — - —  [(n-u)(X  .  -X  )  +  •••  +  (n-v  +  l)(X  -X  ..]) 

ni  ..  v  -  u  u+1  u  v  v-1  ’ 

V>1+1  U<1 

i  =  l,2,...,n-l  and  r^CX^)  =  “•  For  the  remaining  values  of  x,  ?  (x)  is 

determined  by  (3.2)  with  replacing  r  and  r*  .  The  corresponding 

estimators  F  and  f  for  F  and  f  are  obtained  from  r  using  (2.1) 

n  n  n  ^ 

and  the  relation  f  (x)  =  ?  (x)[l  -  F  (x)). 

n  n  n 

It  is  of  interest  to  note  that  the  estimator  r  can  also  be  written 

n 

in  the  form 

v  .. 

(3.7)  ?n(x)  =  inf  sup  {  J  Cl  -  Fn(y)]dy/[Fn(v)  -  F^u)]}--1 

v>x  u<x  u 

y 

where  F^  is  the  empirical  distribution.  Similarly,  when  r(x)  is  increasing, 
it  is  given  by  (3.7)  with  F  replacing  F^. 

4.  Consistency.  In  case  we  restrict  ourselves  to  distributions  with 

support  contained  in  [0 ,<»)  ,  it  is  not  difficult  to  verify  that  the  regularity 

conditions  used  in  [8]  are  satisfied  by  the  family  0  .  Thus  from  the  results 

of  [8],  it  follows  that  F^(t)  is  a  consistent  estimator  of  f(t)  for  this 

restricted  family.  For  fixed  t  <  p,  choose  M  >  r(t);  then  it  follows  that 

f^(t)  =  f  (t).  We  conclude  that  f  (t)  is  a  consistent  estimator  of  f(t) 
n  n  n 

for  the  family  &  of  IFR  distributions  F  satisfying  F(0)  =  0.  However, 

rather  than  verify  the  regularity  conditions,  we  choose  to  give  a  direct 
proof  of  consistency.  In  so  doing,  we  avoid  the  question  of  whether  or  not 
the  regularity  conditions  are  satisfied  when  F(x)  >  0  for  all  x. 
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Theorem  4.1.  If  r  is  increasing,  then  for  every  t^, 

(4.1)  r(t_.-)  <  lim  inf  r  (t  )  <  lim  sup  r  (t  )  <  r(tn+) 

O  —  n  U  —  n  u  ■—  v 

with  probability  one. 


Proof.  The  right-hand  inequality  is  trivial  if  r(t^+)  =  otherwise, 
let  t^  >  t^  satisfy  r(t^)  <  ®,  and  let  a  (n)  +  1  be  the  index  of  the 
largest- observation  <  t ^ ,  j  =  0,  1.  Let'  N^(n)  and  N^Cn)  be  defined  by 


*n(V  = 


r  i  N^(n)  r1 

=  [  N2(n)  -  Nl(n)  i=Nifn)+1(n  -  l)(Xi+l  -  Vj  • 


-r(t. )y 

Let  Y  =  -[r(t  )]  log[l  -  F(X)] ,  so  that  p[y  >  y}  =  P{l  -  F(X)  <  e  } 

-r(t1)y 

=  e  ,  i.e.  ,  Y  has  an  exponential  distribution.  Since  the  X^  are 

order  statistics  from  the  distribution  F,  Y^  =  ~[r(t^)]  1  logtl-FCX^)] 
are  order  statistics  from  the  exponential  distribution,  and  ( n’-  i) ( Yi+1  -  Y^ 
are  independent ,  identically  distributed  exponential  random  variables ,  with 
mean  l/r(t^).  Finally, 

X.  ,  X.  X.  , 

.  l+l  i  l+l 

(4.2)  Yi+i  “  Yi  =  Cr( 't-1)3  XCJ  r(z)dz  -  f  r(z)dz]  =  J*  [r(z)/r(t;L)]dz 


<  X.+1  -  X.  ,  i  <  a1(n). 


From  (3.6)  and  (4.2),  it  follows  that 
_  an(n) 


r  i  1  i 

(O  <  — J  I ■  „  /  "V  £  (n  -  i)  (X.  ,  -  X.) 

0  La1(n)  ”  Ni<n)  i=N1(n)+l  1+1  1  J 

r  i  ai(n)  V 

La1(n)  "  Ni(n)  i=N, (n)+l  1+1  1  J 


But 
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a.  (n) 


r  L  v1 

— T  T~~m~  7  s  2  ( n  -  i)  ( Y.  -,-Y.)|  =  r(tn) 

1^(1.) -Nl(n)  1=  (Q)+1  i+l  ij  1 


with  probability  one ,  by  the  strong  law  of  large  numbers 

(lim  ta^(n)  -  N^n)]  >  lim  [a^Cn)  -  a^(n)]  =  <=  with  probability  one), 
n  n 

We  conclude  that  lim  sup  r  (t  )  <r(t^)  with  probability  one,  and  the  right- 
hand  inequality  of  (4.1)  follows.  A  similar  proof  yields  the  left-hand 
inequality.  || 

Corollary  4,2.  If  r  is  increasing,  then  for  all  t,  lim’ F  (t)  =  F(t) 

n-«> 

with  probability  one. 

Proof.  It 'is  sufficient  to  prove  the  theorem  for  t  satisfying 

F(t)  <  1,  in  which- case  Fn(t)  <1  for  sufficiently  large  n.  By  Theorem 

4.1,  lim  ?  (z)  =  r(z)  except  possibly  for  z  in  a  set  of  Lebesgue  measure 
n-*°° 

zero.  For  z  e[x,t],  x  >  -°° ,  r  (z)  <  and  by  the  Lebesgue  dominated 

t  "  t 

convergence  theorem,  lim  J*  r  (z)dz  =  J  r(z)dz  with  probability  one.  Then, 


by  (2.1), 


(4.3)  lim 


1-Fn;(tl  l-F(t) 


n-4co  1  -  F  (x) 
n 


- - — — f  with  probability  one. 

l-F(x) 


If  we  knew  that  F(x)  =  0  for  some  x  >  this  would  complete  the  proof. 

x 

In  order  to  obtain  an  upper  bound  for  J  rn(z)dz,  we  first  note  that 


r  n  =  [7 - TY~? - Trt  ( n-(n.  +1)  )  (X  )  +  ...  +  (n-n .  ,)(X  .,-X  >]) 

ni+1’ni+l  (ni+l+1)“(ni+1)  1  V2  1+1  ni+l+1  "i+l 


<  {7 - - r-r-[(n-n.  ,)(X  ~-X  , )  +  •  • *+(n-n.  ,)(X  ,-X  )]} 

-  (ni+1+l)-(ni+l)  i+l  n±+2  ni+l  l+l  ni+1+l  ni+x 


ni+l""ni 

(n-ni+i)Un  "-Xn  +1) 

i+l  1 


Let  k  =  k(n)  be  the  index  of  the  largest  observation  not  greater  than  x; 


if  X.  is  in  [X  ,  ,X  ,),  we  obtain  by  (3.6), 
k  n  .+.1  n  .  +1 

0  J+l 


Vu,„. ,  S  1 

3  3+1  3  3  3 


<  j  i  [(n_k)(xk+1-xn  +1))}  \ 

j  o . 


From  these  estimates,  it  follows  that 
x  0-1 


T  r  (z)dz  <  S  r  .  ‘(X  ,-X  ,)+r  ,i  „  ty  y  \  ^ 

•I .  n  "  i=0  ni+1’ni+l  ni+l+1  ni+1  nj+ 1,1 Vl(Xk+l  -  n-k  • 

1  k  F(x) 

If  O  <  e  <  —  and  x  satisfies  F(x)  <  e  ,  then  lim  ^  _  F(x) 

<  -  6  -  <  2e  with  probability  one,  so  that  lim  sup  J*  r  (z)dz  <  2e ,  and  by 
1  ~  6  _00  “ 

(2.1),  lim  inf[l  -  Fn(x)]  >  e~2e  >  1  -  2e.  This  together  with  (4.3)  completes 
the  proof.  (| 

Corollary  4.3.  If  r  is  increasing  and  continuous  on  [a,b],  then 


(i) 

lim 

sup 

Ir  (t) 

-  r(t)|  =  0 

n-«ao 

te[a,b] 

XI 

(ii) 

lim 

sup  1 

Fn(t)  ■ 

-  F( t) |  =  0,  . 

n-*» 

-«<  t<°° 

(iii) 

lim 

sup 

If  (t) 

-  f(t)|  =  0, 

n-«° 

te [a ,b] 

XI 

each  with  probability  one. 


Proof,  (i)  and  (ii)  follow  from  the  same  methods  as  in  the  usual  proof 


of  the  Glivenko-Cantelli  theorem,  (iii)  follows  from  (i) ,  (ii),  and  the  fact 
that  f(t)  =  r(t)[l-F(t)].|| 
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5.  Comparison  between 
respect  to  a  certain  metric 


r  (t)  and  r  (t).  We  shall  show  that  with 
n  -  n 

r  (t)  is  closer  to  r(t)  than  is  r  (t) , 
n  n 


where 


(5.1)  rn(t) 


0 

{(n- J)(Xj+1-Xj>}-1 


CO 


for  0  <  t  <  X^ 


for  X.  <  t  <  X.+1,  3  =  1,2 
for  X  <  t  <  ». 


,n-l 


Note  that  r  (t)  represents  the  "unaveraged"  estimate  of  the  failure  rate, 
n 

i.e. ,  the  estimate  that  does  not  take  into  account  the  requirement  that 
r(t)  be  increasing.  The  result  is  similar  to  an  inequality  of  [1,  page  6V+] 
and  is  really  a  special  case  of  the  results  of  [?].  We  give  a  simple  proof 
for  convenience  and  completeness. 


We  need  the  general  result : 

Theorem  5.1.  Let  h  be  nondecreasing,  g  be  integrable  with  respect 

to  the  measure  p ,  the  discontinuities  of  h  distinct  from  the  points  at 

which  p  places  positive  mass ,  and 

t 

J  g(e)dp(e) 

g(_x)  ,  sup  inf  8  (t).-'(37  • 
s<x  t>x 

Then 

(5.2)  f(g-h)2dp  >  ,f(g- h)2dp  +  f(g-g)2dp. 

Proof,  It  suffices  to  show  j*(g  -  h)  (g  -  g)dy.  ^  0,  The  x-axis  can  be 

broken  up  into  single  points  and  maximal  intervals  on  each  of  which  g(x) 

is  constant.  At  a  single  point  xt  g(x)  =  g(x).  Let  [a,b]  be  an  interval, 

x 

with  g(x)  =  g  on  Ca,b]#  Define  G(x)  =  J*  g(0)dp.(9).  Then 
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b  b 

j*  (g-h(x))(g(x) -g)dp(x)  =  J  [g(x)  -  G(a)  -  g[p(x)  -  u(a)  3  Jdf.h(x)  -  g}  >  0, 
a  a 


since 


G(x)  - 


>  inf 
«*>  *  t>x 


G(t)  -  G(a) 
h(t)  -  p(a) 


=  sup  inf 
s<x  t>x 


G(t)  -  G( s) 
p.(t)  -  v-(b) 


Identifying  h(t)  as  r(t),  g(t)  as  r^Ct)  ,  g(t)  as  rR(t) ,  and 
as  Fn(t),  the  usual  empirical  distribution,  we  obtain  from 
Theorem  5.1: 


Theorem  5.2.  With  probability  one, 

(5.3)  jV  (t)  -r(t)}2dF  (t)  >  JP{r  (t)-r(t)]2dFn(t)  +  f  [rj  t)  -  rn<t)  32dFn(t) 

-CD  11  -CO  -<» 

Thus,  in  the  sense  made  precise  by  (5.3),  rn(t)  cI°ser  bo  r(t) 

than  is  r  ( t)  . 
n 

6.  Decreasing  failure  rate.  A  distribution  F  is  said  to  have 
decreasing  failure  rate  (DFR)  if  the  support  of  F  is  of  the  form  [a,00), 
a  >  .a.i  and  if  logCl-F(x)]  is  convex  on  [a ,")  .  Such  distributions  arise, 
e.g„as  mixtures  of  exponentials  (see  [93). 

If  F  is  DFR  then  by  an  argument  similar  to  that  used  in  the  IFR  case, 

it  is  absolutely  continuous  except  possibly  for  a  discontinuity  at  the 

point  a.  Thus,  the  measure  determined  by  F  is  absolutely  continuous  with 

rescect  to  u  =6  +  A  where  6  places  unit  mass  on  (a]  and  A  is 

r  a  a  a 

Lebesgue  measure;  we  denote  the  density  of  F  with  respect  to  by  f, 

and  again  define  the  failure  rate  of  F  by  r(x)  =  f(x)/[l  -  F(x-) 3 .  If  F 
is  DFR,  we  always  take  a  version  of  f  for  which  r  is  decreasing  in  (a,“»). 

Allowing  for  the  fact  that  f  is  a  density  with  respect  to  Pa,  we  see 
that  (2.1)  is  replaced  by 
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(6.1)  l-F(x)  =  [1  -  r(a)]exp[-  J*  r(z)dz]. 

(a  ,x] 

Estimation  in  the  DFR  case  parallel  that  in  the  IFR  case,  but  with 
some  interesting  differences.  The  first  of  these  is  that  there  are  really 
two  problems  in  the  DFR  case  ,  depending  on  whether  or  not  the  point  a  is 
known. 

First  consider  the  case  that  a  is  known  and  suppose  a  =  X^  =  ••• 

=  <  X^+^  <  • *  *  <  Xn  (in  case  k  =  0,  we  define  XQ  =  a).  Using 

(6.1) ,  f(x)  =  r(x)[l  -  F(x-)]  and  the  relations  r(a)  =  f(a)  =  F(a+) ,  we 
write  the  log  likelihood  in  the  form 

X. 

n  n  i 

k  log  r(a)  +  (n-k)  log  (l-r(a))  +  E  log  r(X.)-  £  J  r(z)dz. 

i=k+l  1  i=k+l  a 

Maximization  of  the  first  two  terms  yields  rn(a)  =  k/n  =  F(a+).  Maximiza¬ 
tion  of  the  last  two  terms  is  quite  analogous  to  that  in  the  IFR  case,  and 
yields  for  r  the  estimator 

f  (x)  =  r  (X. ),  X.  <  x  <  X  ,  i  =  k+1 , . . . ,n 
n  n  l  x— x  —  l 

where 

rn(X^)  =  max  min  -[(n  -  u)(xu+1“  X,)  +  ***  +  (n  -  v  +  1)  (X^  -  Xv_^)  ]  }  , 

v>i  u<i-l 

and  Xq  =  a  in  case  k  =  0. 

Contrary  to  the  IFR  case,  this  DFR  estimator  is  not  unique;  it  is 
determined  by  the  likelihood  equation  only  for  x  <  X^,  and  may  be  extended 
beyond  X  in  any  manner  that  preserves  the  DFR  property. 
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Consider  now  the  case  that  a  Is  unknown,  and  assume  for  the  moment 
that  F  is  absolutely  continuous  with  respect  to  Lebesgue  measure.  If 
F  is  DFR  on  [a,®)  for  a  >  ,  then  the  likelihood  A(F)=  TTf^)  = 

If  F  is  DFR  on  [a,®)  for  a  <  X1 ,  then  A(F)  <  A(F)  where  F  is 
defined  by 


F(x)  = 


F(x)  -  F(X1) 


1  -  F(X1) 


x  >  X 


x  < 


Thus  the  maximum  likelihood  estimator  for  a  unknown  is  found  among  those 
DFR  distributions  with  support  [X]L ,®)  ,  and  the  problem  reduces  to  the  case 
of  known  a.  Note  that  the  estimator  F  has  a  jump  of  at  least  l/n  at 


X 


1* 


The  proof  of  consistency  in  the  DFR  case  is  similar  to  the  proof  in 
the  IFR  case. 

7#  The  discrete  case.  A  related  problem  of  interest  occurs  in  the 
case  that  F  is  discrete  IFR.  If  F  is  a  discrete  distribution  with  mass 
p±  at  x±,  i  =  ...,-1,0, 1,2,...  and  the  x±  are  ordered  increasingly, 
the  ratio 

CO 

p ,  =  p ./  £  P  •  ,  i  =  •  •  •  •  •  * 

0=i  J 

is  called  the  (discrete)  failure  rate  of  F.  If  r±  is  increasing,  then 
F  is  said  to  be  discrete  IFR.  It  is  easily  verified  that 
i-1 

p ,  =  p .  TT  (i_Pj)»  •••  » •  •  •  • 

1  1  i=_<s  J 
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If  a  sample  of  n  independent  observations  from  F  consists  of 
occurrences  at  x^ ,  where  for  notational  convenience,  i  =  then 

the  log  likelihood  function  is 
k  k 

L  =  S  mi  log  pi  =  £  {itk  log  +  (mi+1+- •  .+mk)log(l  -  p±)  }. 

i=l  i=l 

We  wish  to  maximize  L  subject  to  <  p^  <  • • •  <  p^. 

With  proper  identification,  this  problem  is  exactly  the  one  solved  in 
Cl].  The  solution  is  obtained  by  averaging  (through  adding  numerators  and 
denominators)  the  quantities 

0  ,  i  <  1 

(7.1)  p?  =  mi/(mjL+'- *+1^)  ,  i  =  l,2,..,k 

1,  i  >  k, 

to  eliminate  any  reversals  pt  >  p^+1.  After  sufficient  averaging  a  set 
of  increasing  estimates  p^,...,Pk  are  obtained  which  may  be  written  as 

m  +m  -  +  •••+  m 

(7.2)  p.  =  min  max  — - — - ^  , 

1  k>r>i  s<i  «  ,  . 

0=r  0 

The  estimator  given  in  Section  3  Tor  the  continuous  case  may  be  derived 
from  this  as  a  limiting  case.  Consistency  of  the  estimator  (7.2)  follows 
as  in  [1], 

If  p^  is  decreasing,  then  F  is  said  to  be  discrete  DFR.  In  this 
case,  maximum  likelihood  estimators  may  be  obtained  and  consistency  proved 
using  the  same  method  as  in  the  discrete  IFR  case  with  obvious  modifications. 
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